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Abstract 

Finite unit norm tight frames provide Parseval-like decompositions of vectors in terms of redundant components of 
equal weight. They are known to be exceptionally robust against additive noise and erasures, and as such, have great 
potential as encoding schemes. Unfortunately, up to this point, these frames have proven notoriously difficult to 
construct. Indeed, though the set of all unit norm tight frames, modulo rotations, is known to contain manifolds of 
nontrivial dimension, we have but a small finite number of known constructions of such frames. In this paper, we 
present a new iterative algorithm — gradient descent of the frame potential — for increasing the degree of tightness of 
any finite unit norm frame. The algorithm itself is trivial to implement, and it preserves certain group structures present 
in the initial frame. In the special case where the number of frame elements is relatively prime to the dimension of the 
underlying space, we show that this algorithm converges to a unit norm tight frame at a linear rate, provided the initial 
unit norm frame is already sufficiently close to being tight. By slightly modifying this approach, we get a similar, but 
weaker, result in the non-relatively-prime case, providing an explicit answer to the Paulsen problem: "How close is a 
frame which is almost tight and almost unit norm to some unit norm tight frame?" 
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1. Introduction 

Frames provide numerically stable methods for finding overcomplete decompositions of vectors, and are ubiqui- 
tous in signal processing applications lfl6i[l7ll . As explained below, tight frames and unit norm frames are particularly 
useful. However, it is difficult to construct frames which possess both of these properties simultaneously, called unit 
norm tight frames (UNTFs). In this paper, we present a new method for overcoming this difficulty, namely an it- 
erative procedure which, when applied to a given finite unit norm frame, asymptotically produces a UNTF. To be 
precise, under the additional assumptions that the number of frame vectors is relatively prime to the dimension of 
the underlying space and that our initial unit norm frame is sufficiently close to being tight, we are able to show that 
our method, namely a gradient descent of the frame potential, converges to a UNTF at a linear rate. That is, from a 
tightness perspective, our algorithm takes a good unit norm frame and makes it perfect. As such, it can be viewed as 
a frame-theoretic analog of Auto-Tune™ , the software commonly used in the music industry to perfect the pitch of 
lesser vocalists. Moreover, in the non-relatively-prime case, we can slightly modify our argument to yield an explicit 
answer to the Paulsen problem J2]: 

"How close is a frame which is almost tight and almost unit norm to some UNTF?" 

To make these notions precise, consider the synthesis operator of a sequence of vectors F = {f n } N =l in a real or 
complex M-dimensional Hilbert space Mm, namely F : — » Mm, Fg := J^„ =1 g(n)f n . That is, viewing Mm as R M 
or <C M , F is the M x N matrix whose columns are the /„'s. Note that here and throughout, we make no notational 
distinction between the vectors themselves and the synthesis operator they induce. The vectors F are said to be a 
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frame for H M if there exists frame bounds < A < B < oo such that A||/|| 2 < ||F*/H 2 ^ B \\f\\ 2 f° r all / e 13m- In this 
finite-dimensional setting, having Fbea frame is equivalent to having the f n 's span Mm, necessitating M < N, with 
the optimal frame bounds A and B corresponding to the least and greatest eigenvalues of FF*. In particular, F is a 
tight frame when A = B, that is, when FF* = Al. Tight frames are useful in applications, as they provide Parseval-like 
decompositions 

N 

f=\FF\f=\Yj(f,f n )f„ V/ e Mm, (1) 

n=l 

despite the fact that the /„'s are not required to be independent. Indeed, the tightness condition FF* = Al does not 
require the columns of F, that is, the /„'s, to be orthogonal, but rather, it requires the rows of F to be orthogonal and 
have equal norm VA. Meanwhile, F is a unit norm frame when ||/„|| = 1 for all n = l,...,N. When a frame is both 
unit norm and tight — a UNTF — it breaks vectors into possibly redundant components of equal weight (Q~|), with the 
tight frame constant A being the redundancy ^. UNTFs are known to be exceptionally robust against additive noise 
and erasures |0,[I2,[T3l[3l. Unfortunately, UNTFs are also notoriously difficult to construct: we want MxN matrices 
F that have unit norm columns and orthogonal rows of equal squared-norm ^ . To be clear, UNTFs are known to exist 
for any M < N: one may either invoke the classical theory of majorization for matrices, or more simply, consider the 
harmonic frame obtained by truncating an N x N discrete Fourier transform (DFT) matrix 11211 . Another technique 
is to build an operator with a flat spectrum using weighted DFT blocks; this spectral tetris method yields extremely 
sparse UNTFs [6]. However, these techniques only produce certain examples of UNTFs, while the set of all UNTFs, 
modulo rotations, contains nontrivial manifolds whenever > M + 1 I Kill . That is, these methods produce but a few 
samples from the continuum. 

In this paper, we provide a new method for starting with a given frame and producing a nearby UNTF from it. Such 
techniques are very useful in real-world problems, as they allow one to take a given transform, carefully crafted to 
have certain application-specific properties without being tight and/or unit norm, and to correct, or tune, its algebraic 
properties while changing the transform itself as little as possible. In terms of mathematics, these techniques are 
important because they help in solving the Paulsen problem. To be precise, a compactness argument of D. Hadwin J2] 
shows that indeed, if a frame is sufficiently close to being both tight and unit norm, then it is, in fact, close to a 
UNTF. Current work on this problem therefore focuses on how close these UNTFs are, as well as developing practical 
schemes to obtain them. Unfortunately, finitely-iterative techniques using Givens rotations (H, 14] have, to this point, 
produced UNTFs that are not necessarily close to the originals. 

More recent approaches to solving the Paulsen problem, namely that of [2] and the present method, rely upon the 
fact that given any frame F, it is straightforward to produce a unit norm frame from it: simply replace each /„ with Sj. 
Moreover, one can also convert any frame into a tight frame, provided one has the computational power to take the 
inverse square root of the frame operator: consider (FF*)~?F. However, combining these two operations — dividing 
by the root of the frame operator and then normalizing the resulting vectors, or vice versa — does not yield UNTFs, as 
these two operations do not commute. Nevertheless, by using one of these two techniques, one may assume without 
loss of generality J2l that either the initial frame is exactly tight and nearly unit norm or, alternatively, that the initial 
frame is exactly unit norm and nearly tight. The former approach is that taken by starting with a tight frame that 
is not unit norm, they solve a differential equation that minimizes frame energy while preserving tightness, flowing 
towards a UNTF; this led to the first genuine solution to the Paulsen problem in the special case where M and N are 
relatively prime. The latter approach is the one we pursue here. 

In particular, starting with a frame that is already unit norm, we try to produce a UNTF from it. Preliminary results 
to this end were reported in the conference proceedings paper . We accomplish this task by descending against the 
gradient of the frame potential, namely the square of the Hilbert-Schmidt norm of the Gram matrix F*F, regarded as 
a function over N copies of the unit sphere Sm := {/ e H M : ||/|| = 1): 

N N 

FP : § N M -» K, FP(F) = \\F*F\\ 2 m =^jTj M"f*>P- 

n=\ n' = l 

Introduced in yj], the frame potential is the total potential energy contained within a given collection of points on the 
sphere under the action of a frame force which encourages orthogonality. As discussed in the next section, one can 
show that FP(F) = + \\FF* - ^IH^s f° r an Y F e ^at That is, the frame potential is bounded below by jj, with 
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equality if and only if F is a UNTF. The main result of (lj] gives that even local minimizers of FP are UNTFs. As such, 
even if no explicit constructions of such frames were known, they must exist: FP is a continuous function over the 
compact set Sj^, and as such, possesses a global minimizer, which is necessarily a local minimizer, which is necessarily 
a UNTF. This existence argument has been generalized to numerous other settings . Moreover, 

this fact implies that every local minimizer of FP is necessarily a global minimizer, which is a nice property to have 
when performing gradient descent; even here, this task is nontrivial however, as there are nonoptimal arrangements at 
which the first derivative of the frame potential vanishes |Q]]. 

The novelty and significance of our work is best gauged by contrasting it with the current state-of-the-art of the 
Paulsen problem: the technique of [2]. Both approaches give valid solutions to the Paulsen problem and have certain 
applications for which they are preferable to the other. Instead of assuming our frame is already tight and seeking to 
become increasingly unit norm yfl, we assume we are already unit norm and seek tightness. Rather than needing to 
solve a differential equation J2], we have an iterative, gradient-descent-based algorithm; our approach only becomes 
a differential equation when the step size is forced arbitrarily small. While the relative primeness of M and N is an 
important consideration in both methods, the technique of [2y is only guaranteed to converge in this case, while our 
convergence argument generalizes to the non-relatively-prime case, albeit in a weaker form. Also, as shown below, 
our method preserves the group structure of certain UNTF constructions, such as Gabor frames and filter banks, 
whereas [2] does not. 

In the next section, we introduce the fundamental concepts needed to compute the gradient of the frame potential 
(Theorem|2]l and study its group invariance properties (Proposition|3). In Section 3, we find sufficient conditions that 
guarantee that gradient descent of the frame potential converges to a UNTF at a linear rate (Theorem|6]l. In the fourth 
and final section, we show that these sufficient conditions are indeed met provided M and N are relatively prime and 
the initial frame is already sufficient tight, yielding an answer to the Paulsen problem in this case (Corollary [8}. We 
further discuss how these arguments generalize to the non-relatively-prime case (TheoremfTTTi. 



2. The gradient of the frame potential 

In this section, we lay the groundwork for our approach to modify a given unit norm frame so as to decrease its 
distance from tightness. As such, our first priority is to formally define this distance. Let {A„,}^ =1 be the eigenvalues 
of the frame operator FF* of some unit norm sequence F = {f n } _ v Note that since 

M N 



J] A m = Tv(FF*) = Tr(F*F) = £ ||/„|| 2 = N, 



the average value of these eigenvalues is j^. Moreover, F is a UNTF if and only if FF* - that is, if and only 
if all the A m 's are equal to ^. As such, in the past, the distance from tightness of a unit norm frame F has usually 
been defined as max,,, \A m - However, as there is no closed-form expression for eigenvalues exist, we propose an 
alternative measure of tightness, namely the 2-norm of the values {A m — 

M 

£(A m - £ ) 2 = \\ F F* - g l£ s = Tt[(FF*) 2 ] - 2jjTi(FF*) + ^Tr(I) = FP(F) - (2) 

In particular, we see that FP(f ) > with equality if and only if F is a UNTF. It therefore makes sense to define 
our notion of the distance from tightness of F to be the easily computable quantity \\FF * - ^I||hs = (FP(-F) - j^) J ■ 
Written in this language, the version of the Paulsen problem on which we focus is the following: 



Given positive integers M and N, find possibly (M, N)- dependent constants 5, C and a such that given 

—I 

M 



any unit norm sequence F such that \\FF* — ttI||hs ^ 5, there necessarily exists a UNTF F such that 



\\F - F\\ ns < C\\FF* - ^l|| HS - (3) 
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One way to get a ballpark estimate on what these parameters 5, C and a should be, under the best possible 
circumstances, is to solve a weaker problem: given a unit norm frame F, find F such that FF* = ^1 and such that 
\\F - F\\ns is minimized; here, we do not require that F be unit norm. Similar problems have been extensively studied 
in the past — see [2] for references. In brief, we have that for any such F and F, \\F - F||™ - 2N - 2RdTr(F*F). 
Taking the singular value decomposition F = UYV and letting £ = U*FV* so that F = t/£V, we are therefore 
seeking to maximize ReTi(F*F) = ReTr(£*X) subject to the restriction that ££* = ^1. As X is "diagonal," this 
maximum is achieved by letting £ also be "diagonal" with entries (jj) 1 , implying 

M M 

\\F - F|£ s = 2N - 2ReTr(£*S) > 2N - 2(|)i £ 4 = £[a* 

m=l m= 1 

Multiplying the terms in these summands by their conjugates + (|j) 3 then yields 

M (X - — } 2 M 
"=1 + (|?) 5 J '"=! 

To summarize, the UNTF F which is closest to F necessarily satisfies \\F - F ||hs > (^) J ||FF* - ^IIIhs- As such, in 
our version of the Paulsen problem (0, the best a we should expect is a — I. Indeed, in the case where M and N 
are relatively prime, we show that a = 1 is achievable, provided 6 and C are suitably chosen. Meanwhile, when M 
and N have a common divisor, a simple example, given in Section 4, shows that the best one can expect is a — |. 
As we shall see, the key issue with the non-relatively-prime case is that there exist UNTFs which can be partitioned 
into mutually orthogonal subcollections; at such frames, the geometric structure of the set of surrounding UNTFs is 
extremely complicated ifioll . 
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2.1. The gradient of the frame potential 

Now that we have formally defined the distance from tightness of a unit norm frame F to be \\FF* - |jI||hs> an d 
having further posed the problem we are trying to solve with (0, we turn to our specific approach: a gradient descent 
of the squared distance from tightness, which, since \\FF* - |jl||^ s = FP(F) - ^jj, reduces to a gradient descent of the 
frame potential. Here, as the domain of optimization §^ is a product of spheres as opposed to the entire space H^, 
this version of gradient descent differs from the one most commonly used. In particular, given F = {f n } = \ m and 
G = {g n ) N n=l in e^/^ := {{g n }" = i e : (f n ,g„) = 0, V«}, we use Lemma 2 of H along with Taylor's theorem to 
estimate the change in frame potential as each /„ is pushed along a great circle with tangent velocity g„: 

Proposition 1. For any F = {/„}^ =1 e and G = {g n ) N n=l e let f n (t) :^ cos(\\g„\\t)f„ - sm(\\g n \\t)^ 

whenever g„ + 0, and let f n (t) :- f, otherwise. Then, F(t) = {f„(t)}^ =l 6 for any t G R and satsifies 

N 

IIFCO-^IIhs^Z 11 ^ 112 ' (4) 

n=\ 

N N 

FP(F(0) < FP(F) - 4rRe £<FF7„, *„> + $Nt 2 Z ^nf- (5) 

Proof. It is straightforward to show that ||/„(0ll = 1 for all n = l,,..,N and all t e R. To show ©, note that for any n 
such that g„ t 0, we have 

WMt) - f n \\ 2 = (cosGlfcJIf) - l) 2 + sin 2 (||g„||?) = 4 sin 2 (||g„||f/2) < ||g„||¥. (6) 

As © also immediately holds for any n such that g„ = 0, we may sum (0 over all n to conclude To prove (|5), we 
apply Taylor's theorem to <p(f) = FP(F(f)) at t — 0: 

(p{t) < <p(0) + f£(0) + k 2 max \ipXs)\. (7) 
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To compute the terms in 0, note that f n (t) = -\\g„\\ sin(\\g n \\t)f„ - cos(\\g„\\t)g n for any n such that g„ + 0, a fact that 
also holds trivially when g n = 0, since f n (f) is constant. In particular, f„(Q) = -g„ for all n = 1, . . . , N. The expression 
for ip(i) given in Lemma 2 of y|] then gives 

N N 

#0) = 4ReTr(F*(0) J F(0) J F*(0) J F(0)) = 4ReTr(-G*FF*F) = -4Re ^(G*FF*Fe n , e n ) = -4Re J](FF*f n ,g n ), (8) 

«=1 n=l 

where {e„}^ =1 is the standard basis of Ha?. Next, as /„(f) = -||g«|| 2 /«(f) for any n, we further have 

N N N 

Tr(F*(0F(?)F*(0F(?)) = 2</f(f)F(0f"(f)F(0«», = 2<*W»(0. F W»(0> = - £ felfll^WnWII 2 (9) 

n=\ n=\ n=l 

Substituting (0 into the expression for ip(t) given in Lemma 2 of [3] yields 

N 

Ht) = -4 Yj \\8n\\ 2 \\F*(t)f n (f)f + W*(t)F(t)\\l s + 2\\F(t)F*(t) + F(t)F'(t)\\ 2 m . (10) 
To bound m, note that ||F(0H 2 s = E^=i ll/»(0ll 2 = N and \\F(t)\\ 2 HS = iti U/"«ll 2 = %n=i tent, and thus 

N 

\<p(t)\ < 4 J] llSnlfllf (0/«(0l| 2 + 4||F*(0F(0IIhs + 2||F(0F*(0 + ^(0^*(0[Ihs 

n=l 

N 2 

< 4 J] ||^|| 2 ||F(0ll^||/„(0ll 2 + 411^(0^(011^ + 2(||F(0F*(0Hhs + I|F(0F*(0IIhs) 

n=\ 
N 

< 4 2 II*-II 2 II^(0IIhs + izIIAOIIhsIFWHhs 

m=1 

= 16A^ ||g„|| 2 . (11) 



Substituting ^ and O into Q yields ©. □ 

Considering the Taylor expansion of FP(F(0) given in (HJl, one might expect the gradient of FP over namely 
the choice of vectors [g„) N _. , modulo positive scalar multiples, which maximizes the linear term Re ^„ = i(FF*f n ,g n ), 
to be given by g„ = FF*f n for all n = 1 , . . . , N. Indeed, one may show that this would be the correct gradient if we 
regarded the frame potential as a functional over the entire space H^. However, since we are optimizing over we 
require that {g„}^ =l e ® N =1 f^~- Therefore, we instead take {g n }^ =l to be the projection of {FF*f n } N =1 onto ®^ l=l ft- In 
the next result, we formally verify that such a choice is optimal. 

Theorem 2. Pick F — {f„}^ =l e and let P„ denote the orthogonal projection from onto the orthogonal 
complement of f n . Then, the minimizer of the bound in Q over alltsM. and \gn}^ =l £ ® N =\fn I s given by t — and 

g n = P„FF*f n = FF*f„ - {FF*f„,f„)f„ n = \,...,N. (12) 

Moreover, for any t € R, this choice for {g n }^ =l gives 

N 

\\F(t)-F\$ ss <t l '£ j \\P H FF*M 2 , (13) 

N 

FP(F(0) < FP(F) - 4f(l - 2Nt) £ \\P„FF*f„\\ 2 . (14) 



Proof. We seek to minimize 



N N N 

- 4?Re Yj(FF*f n ,g n ) + Wt 2 £ ||g„|| 2 = 2 J] Re(-FF*f n + 2Ntg„, 2Ntg n ) (15) 

ra=l «= 1 «= 1 

over all {g n } N =i e §^ and all f 6 K. We note immediately from ( TT3T > that the optimal {g„}^ =1 and t are not unique, 
though we now show that their product is. Indeed, we have P„g„ = g„, and therefore 

Rs{-FF'f n + 2Ntg„,2Ntg n ) = Re(-FF*f n + 2Ntg„,2NtP ngn ) 

= Re(~P n FF*f n + 2Ntg„,2Ntg„) 
= i(||-P„FF7„ + 4Ntg„\\ 2 - \\-P n FF*f n \\ 2 ) 
> -l\\ Pn FFy n \\ 2 , 

with equality if and only if -P„FF* f„ + 4Ntg n = 0. Thus, to minimize 1151 . and consequently to minimize the upper 
bound in ((5]), we may take t = and g n - P n FF*f„, as claimed. Moreover, substituting these choices of g n 's into (0 
and ([5} yields ( IT3b and ((5]), respectively. □ 

Note that for any t e (0, 577), Theorem[2]prescribes a direction and step size to travel from a given F e §^ which 
guarantees a predictable decrease in frame potential. Throughout the remainder of this paper, we fix any such t and 
repeatedly apply Theorem [2] to produce a sequence of iterations which, in many cases, is guaranteed to converge to 
a UNTF. One may also consider what happens to this sequence of iterations as t is taken ever smaller; as t — > 0, we 
expect to approach a solution to the system of nonlinear ordinary differential equations: 

/„(*) = -( F{s)F\s)f n {s) - (F(s)F*(s)Ms), f n (s))f n (s)f„(s) ), Vn = 1 N, 

a matter we leave for future research. 

2.2. The preservation of group structure 

Many popular examples of unit norm frames, such as oversampled filter banks and Gabor frames, have a group 
structure. In particular, such frames are the orbit {Uifj}i e jj e j of a collection of unit vectors {fj}jej under the action of 
a collection of unitary operators {t/,},<=j. While such frames inherently consist of unit norm vectors, it can be difficult 
to ensure their tightness As such, it would be valuable to have a technique which increases the tightness of 

such frames without sacrificing their group structure. The next result shows that the technique of Theorem [2] does 
precisely this, provided the unitary operators are known to commute with the frame operator. 

Proposition 3. Let the orbit F — {fj}ieijej — {Uifj}i £ zj e j of unit vectors have the property that every unitary matrix 
Ui commutes with its frame operator FF*. Then, pushing these vectors along the tangent directions {gij}iei,jej given 
in (1121) produces new collections of vectors which possess this same group structure: F(t) — {Uifj(t)}i £ jj e j. 

Proof We have fij(t) = cos(||gy||f)/y - sm(\\ giJ \\t)^ where g u := /',,/•/• f.,. That is, 

giJ = FF'Uifj - (FF*U t fj, Uifjjujj = UiFF'fj - (u t FF*fj, U^UJj = u(FF*fj - (FF'fj, fj)fj) = U igj , 
where gj :- FF'fj - (FF* fj, fj)fj- We thus have that fj(t) = Ujf(t), as claimed: 

fij(t) = cosQWigjWtWifj - S m(\\U ig j\\t)^ = U { (cos(\lgjWj ~ sin (HS;H f )^) = □ 

For example, consider the space of discrete M-periodic signals ((Zm) = {/ : Z — > C : /(to + M) = /(to), Vm}. 
Letting M = AC, the synthesis filter bank associated with some unit norm vectors {fj}jej is {T A '/}Jl I ye j-, where T 
is the translation operator (T/)(to) := /(to - 1). As one may verify that FF*T Al = T A 'FF*, Proposition [3] guaran- 
tees that evolving the //s according to Theorem [2] preserves this filter bank structure. Letting M = BD, one can 
further consider the Gabor subclass of filter bank frames: the Gabor system associated with some unit norm / is 
jrpA;£B;yjC>i,D^i^ wnere £ j s tne m odulation operator (E/)(to) = e^ f(m). Though the operators E and T do not 
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commute, we nevertheless have that ET = e^TE, a fact which suffices to guarantee that FF*T A 'E B -i = T A 'E B ^ FF *, 
and so Proposition[3]guarantees that the method of Theorem[2]preserves the Gabor structure. In particular, one need 
only evolve / itself, rather than the entirety of its modulates and translates. That is, one need only compute 



FF*f = Y V</,T ai E w />T oi E w / 

i=0 j=0 

and consider /(f) = cos(||g||f)/ - sin(||g||f)jj|ii, where g = FF*f - (FF*f,f)f and t e (0, 577). By iteratively applying 
this procedure, one produces Gabor frames of ever-increasing tightness. 

3. Sufficient conditions for linear convergence of gradient descent 

We now take a given unit norm sequence Fq := F = {/„}^ =1 , and iteratively apply the main result of the previous 
section — Theorem[2] — to produce a sequence {.Fa}£1 of unit norm sequences of increasing tightness. To be clear, 
fixing any t e (0, j^), and given any unit norm sequence F k = {f^}^ =1 , we first compute Gu - {gf}^' 

gf = FfF k F k fW = F k F k fW - Mn = 1, . . . ,N. (16) 



We then define F k = [fn as follows: 



fn • i 8 f=o. (17) 

While Theorem|2]guarantees that the values of \\F k F* — |jI||hs are decreasing, it does not guarantee that this decrease 
is strict, nor that it decreases to zero in the limit, nor that the F^'s themselves converge. Indeed, gradient descent of 
the frame potential does not necessarily converge to a UNTF: despite the fact that every local minimizer of the frame 
potential is also a global minimizer, there do exist suboptimal critical frames F at which the gradient G vanishes IH. 
In this section, we provide conditions which suffice to avoid such nonoptimal critical frames, and moreover, guarantee 
that the iterative application of ( TTol l and ( fTTT i produces a sequence of unit norm frames which indeed converges to a 
UNTF F m = limt Fk that is close to F — F{). To do this, note that a unit norm sequence F is critical with respect 
to the frame potential if and only if its gradient G vanishes, which occurs precisely when each /„ is an eigenvector 
of the frame operator FF* . As noted in (ID, this occurs precisely when F can be partitioned into a collection of 
subsequences, each of which is a unit norm tight frame for its span. Here, the key is to recognize that in this setting, 
such orthogonality is actually one's enemy. To be precise, we make the following definition: 

Definition 4. A sequence {/«}^ =1 e §^ is termed orthogonally partitionable (OP) if there exists a nontrivial partition 
I U J' = (1, . . . , N] such that \{f, fj)\ = for every i € J, j £ J~. More generally, it is e-orthogonally partitionable 
(s-OP) if there exists a nontrivial partition I U - {I, . . . ,N} such that |(/;, fi)\ < s for every i € I, j € J'. 

Thus, one way to ensure G + is to have that F is not OP. Indeed, as we show in the following result, if F is not 
e-OP, then the amount F's frame potential decreases in one iteration of gradient descent, as given in Theorem|2l is at 
least some fixed percentage of F's distance from tightness. 

Theorem 5. Let s e (0, j, and take F e §^ satisfying \\FF* — |^I||hs ^ 2Jr Let P n denote the orthogonal projection 
from Mm onto the orthogonal complement of f„. IfF is not E-orthogonally partitionable, then 

N 

M FF * - ML * Z n p 'W«ii 2 * 4N \\ FF * - ML ( 18 > 

Proof. Let {/l m }^f =1 denote the eigenvalues of FF*, arranged in increasing order, with corresponding orthonormal 
eigenbasis {e m }^ i=l . Decomposing any /„ in terms of this eigenbasis gives 

M M 

y„ := (FF*f n ,f n ) = (FF*^(f,„em}e m ,f„) = ^ A m |(/„, e,„)| 2 . 

m= 1 m=\ 
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cos(||gf \\t)ji k) - sin(||gf \\t)-^-. gf * 0, 



That is, each y„ is a convex combination of FF*'s spectrum. Since, as noted previously, ^ is the average of the A m 's, 
we therefore have y n , |j e [A\, Am], and so for any m and n, 
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(A m - y n f < (A M - A x f < 4max(4, - g ) 2 < 4 - f ) 2 = 4||FF* - £ l\\ m . (19) 

m'=l 

Also, by the definitions of P„ and y„, we have 2^=i \\PnFF*f„\\ 2 = Yl^=\ \\(FF* - y„l)f„\\ 2 . Decomposing each /„ in 
terms of the <? m 's therefore gives 

N N M 2 N M 2 N M 

J] \\PnFF*f n \\ 2 = J] (FF* - y„I)2</«. e m)e m = J] Yf A™ ~ Jn){fn,e m )e m = ^ Yj {Am ~ ^ )2|</ "' e "' )|2 ' 

n=\ n=l m=\ n=l m=l n=\ m=l 

(20) 

From here, we apply (TT~9b to get the right-hand inequality of (fT~8b : 



N M 

|2 V V , / /• m7 ,tI I /VtI|2 



HS' 



Note that this inequality holds in general, that is, for any F e E> M . We now seek the left-hand inequality of (fT8T l. Since 
the largest gap between successive eigenvalues is no smaller than the average gap, there necessarily exists an mo that 
satisfies 

A mo +l - A m<3 > -gZ\(AM - A\) > Jj(Am - A\). (21) 

Define I : = \n : y„ < \(A mo + A mo+ \)), JJ .- (1, . . . , N] \ I. This partitions the y„'s according to where they lie in 
relation to the midpoint \(A m + A mo+ \) of the largest gap between eigenvalues. Therefore, the /l„,'s lying above this 
midpoint are at least half the gap away, namely at least \(A ma+ \ - A m ) > ^ (Am —A\) away, from the y n 's lying below 
the midpoint, and vice versa. In fact, when m > ;«o + 1 and n e I, or when m < mo and n e J', we have 

(A m - y n f > [±(A M - A,)]' > ^ max(i ffl - § f > ^ ^(A m - f ) 2 = ^\\FF* - ^ (22) 

m 

That said, if i e I and j e J', then regardless of m, A m is on one side of the midpoint \(A ma + A„ l0+ \), and either y, or 
jj is on the other side, implying 



{(A m - yd\ (A m - 7j f) > M FF * ~ A- ^ 



max 

Now suppose both I and J' are nonempty. Since F is not e-OP, there exists i e I and j e ST such that s < \(f/, fj)\. 
Decomposing over the eigenbasis, we therefore have 

M 2 M M 

s 2 < \{f h fj)\ 2 < (J] \(fi,e m )\\{fj,e m )\j <Mj] \{fi,e m )\ 2 \(fj,e m )\ 2 < min{|</;, em >| 2 , \(f h e m )\ 2 ^ (24) 

m= 1 m= 1 m= 1 

where the last inequality uses \(f n ,e m )\ < ||/n||||e m || = 1. Recalling d20l >. we isolate the ith and jth terms: 

N N M 



Yj W P nFF*f n f = Yj YP<* ~ rn) 2 \(fn,e m )\ 2 
n=\ m=l 
M 

Yj(( A - ~ yd 2 U,e m )\ 2 + (A* - Jj) 2 \{fj,e m )\ 2 ) 



n=l n=\ m-1 

M 

> 

m=l 
M 



> Y max|(/l m - y,) 2 , (A m - r> ) 2 | min||(^, e m )\ 2 , \(fj, e m )\ 2 



From here, we apply (l23T l and ( |24] | to get 

N M 

£ \\P n FF'f n f > ^FF* - fit 2min{[</;-, em )| 2 , K/;,e m >| 2 } > ^||FF* - $l" 2 



IHS" 



Therefore, we indeed have the left-hand inequality of (TT~8T > in the case where both I and fj are nonempty. We now 
turn to the case where either I or J' is empty. We have 

M 

max(^ - S. f < £ (/ln) - £ ) 2 = ||FF* - £ || 2 S < (^) 2 , (25) 

m=l 

where the last inequality follows from one of our assumptions. Therefore, recalling mo from (ETI) . we have 

AT 

Yj l</». e -o>l 2 = F*e mo || 2 = {FF*e m ,e m ) = A m > Ax > f - max^ - f | > (26) 

«=i 

where the last inequality is by (l25l l. In particular, if I is empty, we recall ( l20l i. isolating its moth term: 

AT A 7 M N 

Yj \\ P « FF *fn\\ 2 = J] £(4. ~ r " )2|</ "' gm>|2 " ~ r » )2K/ '" e '"» )|2 ' (2?) 

n=\ n=\ m=\ n=\ 

Since I — 0, then = {1, . . . , N], and thus (l22b holds for m = mo and all n. Coupled with ( l26l i and ( l27l i. this implies 

A 1 A 7 

H=l »=1 

where the last inequality uses s 2 < 1 < j. This proves the left-hand inequality of ([T8l > in the case where I is empty. 
A similar argument — isolating the (mo + l)st term in (1201 — holds in the remaining case where fj is empty. □ 

The previous result, along with Theorem [2] guarantees a certain decrease in frame potential, provided the given 
frame F is not e-OP. In the next result, we show that if, when performing the gradient descent steps ( fT6l ) and ( fTTT i. one 
can ensure that each iteration F* is not e-OP for some e > independent of k, then gradient descent converges to a 
nearby UNTF at a linear rate. 

Theorem 6. Fix e e (0, 1] and t e (0, fafo? F = {/i 0) )^ =1 e §^ satisfying ||F F* - f I|| HS < and iterate 
F,t + i := Fk(t) as in d 1 6b one/ ( 1 1 7b . If for any fixed K, we have that F* ;s not e- orthogonally partitionable for all 
k = 0, . . . , K — 1, men me A'm iteration Fk satisfies 

\\F K - FoIIhs < (S^H^oFq - |l|| HS , (28) 

¥*n - ¥\L * (i - llw - 1 i|| HS . (29) 

Moreover, if 'F\ is not s-orthogonally partitionable for any k, then F^ : = lim^ Fk exists and is a unit norm tight frame 
within (128b from Fq. 

Proof. Define y := and suppose Fj is not e-OP for k = 0,..,,K- 1. Then combining (0, < TT~4T > and the lower 
bound in ( fT8l gives that F^+i := F^(f) satisfies 

IKi^ +1 -fC=FP(^(0)-f 

A 7 

< FP(Ft) - f - 4f(l - 2iW) J] ll^f F k Fl^f 



<[l-4r(l-2^)r]||W*-fl" 2 



HS' 



From here, one may proceed inductively to find that 

II W " ML * [1 - 4*1 - 2Nt)yf\\F F* - % 1^, (30) 

which proves (|29l ), recalling y := Next, let 6 : = 4AT. To prove (l28l >. we use (fT3l . the upper bound in ( fT8l ). and 
(1301 to obtain 

II**, - ^Hhs * f 2 X ll^Vt/f II 2 < ^H*V^ - £ I|| 2 S < f 2 5[l - 4*1 - 2jV*)7]*|M* - f Cs < 31 > 

for all k = 0, . . .,K — 1. In particular, for any K' < K, we can bound \\Fk - /*V|Ihs in terms of a geometric series; 
since t e (0, ^) and y = with e e (0, 1], this series is guaranteed to converge: 

\\F K -F K ,\\ m < £ ||F t+ i -FtHas <tsH^ - 4 *l-2^)r] I )||W-|l|| HS . (32) 

k=K' k=K' 

In particular, letting K' = in (O yields (|28): 

\\F K - FoIIhs < (^[^"^iJllW " S^Ihs * 2(rSv^llW - M\hs> W 

where we have used the fact that (1 - x)* < 1 — \x. 

Now suppose Fk is never e-OP for any k, and so d32l l holds for all K' < K. In particular, as the series in d32l l 
vanishes (independently of /T) as K' grows large, we have that {Fk}™ =0 is a Cauchy sequence. As §^ is complete, 
F m := lim^ exists. Taking the limit of ( 1301 yields HFmF^, - |jI||hs = 0, and so F M is a UNTF. Meanwhile, taking 
the limit of (l33l yields our final conclusion, namely that Foa also satisfies d28l >: 

II**, - *oIIhs < 2 (i-^)y ll F ° F - m I IIhs = 0m? H F ° F o ~ ^IIhs- U 

4. Solutions to the Paulsen problem 

In the previous section, we applied gradient descent to Fq e §^ to produce a sequence of iterates {F^f Q . We 
showed that if Fq is sufficiently tight and if all resulting F^'s are not e-OP for some fixed e > 0, then this sequence 
converges to a UNTF at a linear rate. In this section, we show that such an e always exists, provided M and N 
are relatively prime. Meanwhile, in the non-relatively-prime case, we give an example that shows such e's are not 
guaranteed to exist. In this case, our gradient descent algorithm's rate of convergence is threatened whenever our 
frame becomes nearly OP; to overcome this threat, we "jump" from our current iterate to a nearby OP frame, and then 
continue gradient descent on the individual subframes over their respective subspaces. In so doing, we are able to give 
solutions to the Paulsen problem (0 even in the non-relatively-prime case. 

4.1. Case I: M and N are relatively prime 

Theorem [6] guarantees that gradient descent converges to a UNTF at a linear rate, provided the iterations never 
become e-OP for all arbitrarily small e's. When M and N are relatively prime, this is not a problem: 

Theorem 7. Take F e with M and N relatively prime. If\\FF* — |jI|Ih S < jjr, then F is not ( wjji )-orthogonally 
partitionable. 

Proof. We prove by contrapositive: take F e §^ with M and N relatively prime, and suppose F is e-OP with 
e := jjijjz', we show that \\FF* - |jI|Ih S > jfr. Since F is e-OP, there exists a nontrivial partition I U J — {1, . . . , N] 
such that \(fi,fj)\ < e for every i 6 J, j e J~. Define Fj := {/;};ei and Fj := {fj}jej- The frame operator FjF* T 
has eigenvalues {Ai, m }^ t=l and eigenvectors {ej^ m }^_ v and similarly for FjFj. Without loss of generality, we arrange 
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both sets of eigenvalues in decreasing order. Take X := rprr, and define Mj := #{m : Aj m > A'}, and similarly for 
Mj. We know Mj > 1, since otherwise we have a contradiction: 

M 

1 < Ul = Tr(FjFj) = Tr(FjFj) = 2 ^ J>m < AM' = < 1. 

m= 1 

Similarly, Mj > 1. Moreover, we claim Mj + Mj < M. Indeed, if not, then Span{ej m }^fj n Spanjej-^l^fj has 
positive dimension, and so we may find a unit vector u in this subspace. Since ej ,„ is an eigenvector of FjF* T with 
eigenvalue /lj, m , we have 

Mj Mj 

u = ^{u,e Iiin )e Iim = ^<M,ej, m )j^ 2( ej >' ft) ft, 

m=\ m= 1 

and we have a similar expression with J~. Therefore, we apply the triangle inequality to get 

Mj Mj 

1 = \(u,u}\ 2 = \(^(u,e It J^^{e Itm ,fdfi,^{u,e Jim )^^{ej, m ,fj)fj 

m=\ iel m=l jeS 

Mj Mj 

iel m=l j£j~m'=l 

Mj Mj 

< jfr 2(2 i< M ' e ^>n< e ^^'>i)2(Z i<«.«^>ii<^./y>i). 

/'ej m=l jej" m=l 

where the last inequality comes from \{fi,fj)\ < s and Aj t „„Aj m > > A'. From here, we use t^j = ^2 and Holder's 
inequality to get 

Mi I Mj I Mj I Mj \ 

1 < ^2C KM ' ej -'" >|2 ) 2 (Z Kej '»"^ )|2 ) 2 2(Z KM ' e ^» >|2 ) 2 (Z Ke ^»"^ >|2 ) 2 * ^™ * i 

iel m=l m=l "'=1 m=1 

a contradiction. As a partial summary, we know M/ and are nonzero and Mj + Mj < M. Now, 

M Mj M 

\I\ = Tr(FjFj) = Tr(FjFj) = 2 ^ = 2 /i7 - m + Z /lj ' m ' 

m=l m=l m=Mj+l 

where 2^f =Mj+1 /ij, m < (M — Mi)X. Therefore, £ m fj Aj jjn > |J| - (M - Mx)X, and so Jensen's inequality gives 

2 4, m * 4(2 > iO J i - (* - * £ - 2i r M - < 34 > 

and similarly for J~. We now consider the frame potential of F: 

FP(F) = Tr[(FF*) 2 ] = Tr[(F/Fj + FjFj) 2 ] = Tr[(F 7 Fp 2 ] + Tr[(FjF^) 2 ] + 2Tv[F I F* I FjFy\ . 
Since Tr[FjFjFjFj] = l|F^Fj|| 2 s > 0, we continue: 

Mj Mj 

FP(F) > 2 4, + 2 4, m > % + g - + T^]' («) 

where the last inequality is by ( f34t . Moreover, considering Mj + Mj < M, we have 

Uf 1 IJf > |if 1 (W-I-TI) 2 _ «i , (1-TjM-MjA') 2 A* J_ 
Mj Mj - Mj M-Mj M " 1 ~ MMj(M-Mj) - M M 31 
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where the last inequality uses the fact that M and N are relatively prime — that is, \I\M - MjN is a nonzero integer — 
and Mj(M - Mj) < Also, since Mj, Mj > 1, we have 

+ mgzt < (M - 1)(|J| + \J\) < MN. (37) 

Therefore, combining ||35}, ([36j and (O gives FP(F) > ^ + Jy, meaning ||FF* - |pl|£ s > J,. □ 

Note that Theorem|7]requires sufficient tightness to guarantee that F is not (-gj^ T )-othogonally parti tionable. Since 
gradient descent only decreases the frame potential, Theorem [7] will apply to every subsequent iteration. Therefore, 
by Theorem|6] gradient descent converges to a UNTF in the relatively prime case: 

Corollary 8. Suppose M and N are relatively prime. Pick t e (0, j^), take F$ e satisfying \\FqFq — |jl||^ s < -pr, 
and iterate F^+i '■= F^f) as in (I16l l and d!71 l. Then, F^ :— lim^ F^ exists and is a unit norm tight frame satisfying 

WFco - FoIIhs < A f°2N, ll F o F o - f ^Ihs- 

This solves the Paulsen problem (0 in the case where M and N are relatively prime. To be explicit, taking t = 
we have 6 = 2~-M~^-, C = 8M 2l) Af 8 ' 5 , and a = 1. These constants are roughly comparable to those previously given 
in 13], which were obtained using independent methods. As noted earlier, a = 1 is the best one can hope for in any 
case. In the next subsection, we give an example that shows that these techniques fall apart in the case where M and 
share a common divisor, and moreover, that in such cases, we must set our sights lower with respect to a. 

4.2. Case II: M and N are not relatively prime 

We continue our solution to the Paulsen problem in the remaining case where M and are not relatively prime. 
Let's begin this case with an example in two dimensions: 

Example 9. Take some real F e that is, F = {(cos^sinfl,,)}^ for some collection of 0,,'s. In this case, it is 
known lll2n that F is tight precisely when the sum of {(cos 29„, sin 2#„)}^ =1 vanishes. In fact, one can show that 



N 



FP(F) - t = (Z cos2 e »f + 2 (2 cos °n sin nJ + (J] sin 2 6„J - T = cos 2e »J + (2 sin 2e ») 

n=l n=l n=\ n=i «=1 

and so ||FF* - jIHhs = ■^IIZi^ = i( cos 20, sin 26*)||. That is, given any unit vectors in M 2 , double their polar angles, and 
add the resulting vectors, base-to-tip; for this chain of vectors, the distance between its head and tail is proportional 
to the original vectors' distance from tightness. In particular, our physical intuition tells us that if a collection of unit 
vectors is close to being tight, then their double-angle counterparts must only be slightly perturbed in order to close 
their chain, meaning the original vectors are indeed close to a UNTF. But how close? To begin to answer this question, 
consider the following example: 



F{6) :-- 



cos 6 cos 0] ~ _ [ cos | cos | - sin | 

sin6» -sin0 1 1 ' ( ) : " 



sin | - sin | cos | cos | 



(38) 



One can show that \\F{8)F*{6) - f I|| 2 S = 8 sin 4 0, while j \\P n (9)F(9)F*(e)f n (9)\\ 2 = 32 sin 6 6 cos 2 0. That said, 
unlike in (JTS), there is no factor A independent of 6 such that A\\F(O)F*(0) - f I|| 2 S < £^=i l|F„(60F(6»)F*(60./;,(6>)|| 2 
for all 9. Therefore, at the very least, our analysis of the gradient descent algorithm, given in the previous section, 
must be refined in order to guarantee convergence. 

Nevertheless, in this example, we can show that gradient descent does, in fact, converge to a UNTF, albeit at a 
sublinear rate. Here, g\(9) = 4 cos 6 sin 3 B(— sin 0, cos 0), g2(0) - -4 cos sin 3 0(sin 0, cos 0), and g3(0) = g4(0) = 0. 
Recalling Proposition Q] one can show that F(0;t) = F(0 - 4f cos #sin 3 0). That is, each iteration transforms an 
arrangement of angle into a new arrangement with angle - 4? cos sin 3 0; repeated iterations indeed converge to 
= 0, albeit very slowly. In this way, gradient descent converges to [e\,e\,e2, ez), that is, two copies of the standard 
basis, which is indeed a UNTF. Note that since the limiting frame is OP, we know that for each s > 0, the F^'s 
eventually become e-OP — this is why the linear rate of convergence guaranteed by Theorem|6]does not hold here. 
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This same example can be used to give a baseline on answers to the Paulsen problem in the non-relatively-prime 
case. Indeed, noting that every real UNTF in §2 is the union of two orthonormal bases, we can show that for each 
8 e [0, f ], F(0) is the closest UNTF to F(6). But, \\F(6) - F(6)\\ HS = 4 sin f , which is on the order of the square-root of 
\\F(8)F *(ff) - f-I||^ s as 6 grows small. As such, ( f38T > is a counterexample to the sometimes-voiced belief that distance 
from a UNTF is at worst a linear function of distance from tightness. In other words, recalling (f3j), a = 1 is not 
possible for every M and N; even when M = 2 and N - 4, the best possible a is 4. This leads to three important 
questions: 1) For a given M and N, is the version of the Paulsen problem given in (f3) even solvable? 2) If so, what is 
the best possible a for a given M and N1 3) Is there a single a that works for all M and N, or does performance truly 
depend on the number of common factors between M and Nl Below, we outline an argument that answers the first 
question in the affirmative; the second and third questions remain open. 

As the preceeding example illustrated, gradient descent is not guaranteed to converge in the non-relatively-prime 
case, since there is no s for which iterations never become e-OP. To resolve this issue, we introduce the concept of 
"jumping" to a nearby OP unit norm frame: 

Theorem 10. Let e e (0, j^]. Then, for every e-orthogonally partitionable F e S^, there exists an orthogonally 
partitionable such that \\F - F|| H s < (2N)^(Ms)i. 

Proof. We first claim that for every unit vector / e Mm and every nonzero projection operator P on H«, there exists 
a unit vector g e P(M M ) such that \\f - g\\ 2 < 2\\(I - P)f\\ 2 . If Pf = 0, we may take g to be any unit vector in P(M M ), 
since that would mean \\f - g\\ 2 — 2 — 2\\f\\ 2 = 2\\(I - P)f\\ 2 . Otherwise, we take g = ^p-^, since 

11/ - i^ll 2 = ¥f + v - p)f -M = IK 1 - m^f + v - p )/ll 2 < 

and so the Pythagorean theorem gives 

11/ " m\\\\ 2 = (1 ~ W7\\? m2 + 11(7 " p)fl]2 = 2(1 " - 2(1 - l|p/l|2) = 211(7 " p)fl]2 - (39) 

For simplicity, we take g :- pjjj, understanding what this means when Pf = 0. 

Since F is e-OP, we have I U J' - {1, . . ,,N] such that \(f,fj)\ < e whenever i e I and j e J~. Without loss of 
generality, we take |_T| > YT\. Defining Fj := {/i}, e j, the frame operator FjF* has eigenvalues {/lj, m }^f =1 , arranged 
in decreasing order, and eigenvectors {cj, m }^f =1 . Take A' :- (fj)', and define Mj := #{m : Aj m > A'}. We know 
Mj > 1, since otherwise 

M , 

f < |J| = Tt(FjFj) = Tr(FjFj) = J] A Ijn < MA' = f (Me)* < ^ < 7 • 

m=l 

Therefore, P := 2 m Ii ej <m et is a nonzero projection operator on H^. Moreover, 

M M M M 

Z IK 7 " ^/H 2 = Z Z ](fi ' ei < m) \ 2 = Z W F i e i^ = Z < F ^ F i e ^ e ^<»> = Z < M/1 '- (40) 

I'eX /eJ m=Mj+l m=Mj+\ m-Mj+l m=M T +l 

Also, the fact that ej -m is an eigenvector of FjFt with eigenvalue /Ij ,,, gives 

Mj Mj 9 Mj 2 

Z ^ 2 = Z Z k/>^»>i 2 = Z Z (/' ^ Yj {ei >™ fi)f ) - Z Z i(Z i<^-/>ii</''/i)i) • 



j'ej j'ej m=\ jejm=\ iel 

Continuing, we use \(f,fj)\ < e and Aj m > A': 

Mi t Mi 



jej m= 1 ' iel 



Zii^h 2 ^ZZ(Zk^-/>i) ^ cfri J iZZZi<^-/>i 2 * c&i J i 2 ^i ^ (41 > 



jej je^r m=\ iel iel jej~ m=\ 
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where the last inequality comes from \I\ 2 (N - \I\) < Define F = \f n } N -, by f„ = tt^tt when n e J, and 

~ f/— PI f j i j | j . /i— i Ik 

/„ = yjj_pyf || when n € ST- Then, combining (1391 ) with ( 1401 ) and ( I41l i gives the result: 

- ^iies = YjU - liijll 2 +211// - wmif * Z 211(7 - + Z 2|l ^n 2 < 2M/ + w = 2A w § - D 

The previous result tells us how far we must jump in order to transform an e-OP frame into one that is exactly 
OP. This opens the door for the following procedure for producing UNTFs in the non-relatively-prime case: given a 
collection of unit norm vectors and fixing any e e (0, 1], perform gradient descent until one's vectors become e-OP, at 
which jump to a OP frame, and then repeat this procedure on each of the two subframes. In the following result, we 
use Theorems |6] and [10] to bound how far this procedure will take us from our original frame. 

Theorem 11. Suppose M and N are not relatively prime. Take F e S M such that \\FF* - f I|| HS < (2 21 M 21 N 14 )' 1 . 
Then there exists F € S^, which is either a unit norm tight frame or is orthogonally partitionable, with equal redun- 
dancies in each of the two partitioned subspaces, such that 

||F - F||hs < 3MM||FF* - f l||^ s . (42) 

Proof. Take t := ^ and e := 2^3' M^\\FF* - frlllyc- According to Theorem|6] gradient descent will converge to 
a UNTF, provided iterations never become e-OP. In this way, we either converge to a UNTF F, or produce an e-OP 
frame within (2N)~-(Me)~> of an OP frame F, by Theorem[lO]Either way, Theorems l6l and [TOl give 

\\F - Fib, < ™^L\\FF* - £l|| HS + (2N)HMsV = 3"*7JlfM||FF* - % l||^ s , 

which proves (l42l i. Now suppose F is OP. Since 

|FP(F) - FP(F)| = Tr[(FF* - FF*)(FF* + FF*)] 
<||FF*-FF*||hsI|FF*+FF*||hs 
< IIF - FHh S (|IF|Ihs + ||FHhs)(||F|£ s + \\F\Q, 

we use ||F||* S = ||F|| 2 S = N to get |FP(F) - FP(F)| < 4Arl||F - F|| HS . Therefore, 

FP(F) < FP(F) + |FP(F) - FP(F)| = g + ||FF* - f l£ s + |FP(F) - FP(F)| < f + ||FF* - $ l£ s + 4m \\F - F|| HS . 
Continuing, we apply 02) and use the fact that ||FF* - ^III 2 ^ < 4ATi(3M^5 ||FF* - f I||^ s ): 

ppm < it + 1 Iff* - ^ill 2 + 4a^(3mMIIff* - ^ < ^ + 24M ^ 2 , = + X (43) 

v M il M 1 llHS V \ J V II A/ MHS J M (2 21 M 27 N l4 )7 « m 3 " k ; 

Since F is OP, there exists an orthogonal partition I U J' = {1, . . .,N}. Take M/ to be the dimension of the span of 
{/«)«£/■ Then, 

mF) = FP(F ) + FP(j p } > m! + (EdM. = (\i\m-mjn? 

\ / 1 r \' 1 ) r \' JV - Afj M-Mj M T MMj(M-Mj)- 

In particular, if |J|M - MjN + 0, then (\I\M-M T N) > 1, and since Mj(M - M£) < f> we would have 
FP(F) > ^ + -4. Considering g3), we may conclude that |J|M - MjJV = 0, and so f = ^ = -g^k □ 



Repeated applications of Theorem Q~TJ will provide solutions, albeit inelegant ones, to the Paulsen problem given 
in 0. To elaborate, Theorem[TTJstates that if a unit norm frame F is sufficiently tight, then there exists a unit norm F 
such that ||F - F||hs = 0(||FF* - ^I|| 7 ) which is either a UNTF or is OP into components of equal redundancy. Since 
we are done if F happens to be a UNTF, let's focus on the case where F is OP, that is, when F = Fr © Fj, where 
Fi = {f}iei and Fj = {fj}jej m ' e frames for some Mj- and My-dimensional subspaces of Mm, respectively, and 
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-S- = = jj. We then apply Theorem[TT]to Fj and Fj: if each is close to a UNTF, these can be directly summed to 
form a UNTF which is close to F and in turn, to F; if either is OP, we must continue this process in lower-dimensional 
subspaces. At most M such nested applications of Theorem[TT]are necessary, since each reduces the dimension of the 
space in consideration by at least 1. The main issue is that each application of Theorem[TT]comes at a terrible cost: 
"jumping" from an e-OP sequence to an OP sequence can increase one's frame potential by a constant multiple of the 
jump distance. In particular, with each application of TheoremQT| one's distance from tightness may be effectively 
raised to a i power; when one's distance is very small, this exponentiation results in a dramatic increase in distance. 
When applied M times in succession, one would therefore expect a net exponent of 4?-. That is, we expect that there 
exists an extremely small 6 > and an extremely large C for which ® will hold for a = jm. It is unknown whether 
such an M-dependent a is inherent to this problem, or simply a consequence of a weak argument on our part. 

We emphasize that such issues, while of great mathematical interest, should cause little worry in real-world ap- 
plications. Indeed, the "perform gradient descent and jump when approaching OP" method that we employed in the 
proof of TheoremQT]produces UNTFs which, for all practical purposes, are close to their originals. Nevertheless, the 
issue stands: this distance may not be a nice function of the tightness itself. Indeed, this is the heart of the part of 
the Paulsen problem that remains open: "Given a unit norm frame which is extremely close to being tight, and is also 
extremely close to being OP, how far away, as a function of tightness, is the nearest UNTF?" This problem reveals our 
current lack of understanding of the geometry of the set of all UNTFs on very small neighborhoods of OP UNTFs, 
and is more than worthy of additional study. 
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